home
***
CD-ROM
|
disk
|
FTP
|
other
***
search
/
Resource Library: Multimedia
/
Resource Library: Multimedia.iso
/
hypertxt
/
msdos
/
montana0
/
sgmi
< prev
next >
Wrap
Text File
|
1993-03-05
|
41KB
|
764 lines
============================================================
[S8] ENTRY-LEVEL SGML PARSERS AND RELATED SGML SOFTWARE
============================================================
INTRODUCTION: No opinions, judgments, recommendations or warranties
are offered with respect to the SGML products noted below. The
selection reflects a simple, minimal goal of pointing the interested
beginner to some low-cost, entry-level (PC and Macintosh
microcomputers) SGML tools, and to public domain sources which may be
freely used or modified in experimentation. Special place is given to
ARC-SGML and derivative materials, since they have generously been
placed (via liberal license terms) into 'public domain.'
Anyone wishing to evaluate SGML software for commercial or large-scale
production purposes should follow normal industry standards in making
such evaluation. Three published resources described under section 9
below may be of assistance: (1) The SGML Source Guide [134], edited
for the GCA by Marion Elledge, supplies summary information on SGML
products and services; (2) Bibliography on SGML (Standard Generalized
Markup Language) and Related Issues [133], Cover/Duncan/Barnard,
contains 117 entries on SGML products, services, and development
projects; (3) SGML Products and Services [135], compiled for the CALS
in Europe SIG by Joan Smith, covers CALS-SGML software. A technical
report prepared by Nicholas Duncan for the TEI summarized major
companies/products in the SGML arena: Nicholas Duncan, "Companies
Producing SGML-Related Products," TEI Syntax and Metalanguage
Committee, Technical Report TEI MLP28 (October 2, 1990); the document
is available from Wendy Plotkin; see [114]. Other primary resources
are the seasonal and annual trade shows, SGML workshops, (GCA) TechDoc
conferences, (GCA) MarkUp and SGML Conferences and other electronic
publishing events. Printed resources include the trade publications
and dedicated SGML newsletters, which regularly feature new SGML
software products: CALS Report, SGML Report, SGML Users' Group
Newsletter ("New Products and Services" section), SGML Users' Group
Bulletin, CALS Journal, EuroCALS Newsletter, [TAG], EPSIG News,
Seybold Report on Publishing Systems, Release 1.0, etc. Included in
the online CALS-BBS support is a file on CALS-Related Software
Packages, maintained by the Tech Transfer Committee of the DOD/ISG
Small Business Task Group: anon-FTP to durer.cme.nist.gov
(129.6.32.4), and get the file pub/cals/bulletin/bullet51. See more at
[117].
A large proportion of serious (industry, government, academic) SGML
work is carried out on workstations, mini- and mainframe computers.
These SGML-aware editors, transducers, translators and other
facilities are numerous, and in general could not be evaluated here.
For a summary of some SGML-aware editors, see: Dale Waldt, "Overview
of SGML-Smart Text Editors," [TAG] 17 (December 1990) 12-15; he
reviews IBM TextWrite; Datalogics WriterStation; SoftQuad
Author/Editor; Yard Software Write-It; Software Exoterica CheckMark).
The parsers and other software resources noted below are these:
* ARC-SGML
-Original ARC-SGML 'VM2' Release for MS-DOS, with C-Sources and
Documentation by Charles Goldfarb
-UNIX port of ARC-SGML, by James Clark
-'SGMLS' version of ARC-SGML, by James Clark
-Macintosh port of ARC-SGML, by Wally Wedel
* Amsterdam (Free University) Parser
* Software Exoterica (XGML Validator; XGML Normalizer, XGML
Translator, XGML OmniMark)
* SEMA Group/Yard Software Systems Limited (Mark-It Light)
* SoftQuad (Author/Editor)
* E2S (EASE -- E2S Advanced SGML Editor)
* Electronic Book Technologies (MS-Windows) DynaText SGML
Browse/Search
* Avalanche Development Company (FastTag, IMSYS) - Intelligent
Autotagging
* qwertz/FORMAT -- SGML to LaTeX (and nroff/troff) Translator
* NIST "SGML parser materials"
* Public DTD Collections
[118] ARC-SGML
[119] Original ARC-SGML 'VM' Release for MS-DOS, with C-Sources and
Documentation by Charles Goldfarb
In July 1991, a significant set of SGML Parser Materials was made
available via the SGML Users Group (SGMLUG). The materials consist of
source code (in C), executable binary (for PCs), examples, and
documentation. The SGMLUG has been granted "an irrevocable, royalty-
free, worldwide, non-exclusive license to use, execute, reproduce,
display, perform and distribute copies of, and to prepare derivative
works based upon these materials; and the right to authorize others to
do any of the foregoing".
The grantor of the license to the SGMLUG, who has chosen to remain
anonymous, does not warrant the Parser Materials in any way. Dr.
Charles Goldfarb, the Honorary Technical Consultant to the SGMLUG, has
created some documentation to introduce the materials.
The SGML Project at the University of Exeter has agreed to make the
Parser Materials and the associated README file available on the
academic network, and to act as a collector of enhancements and
changes to them.
Reviews of the parser appeared on the Usenet Newsgroup comp.text.sgml
in July-August 1991 by François Chahuneau, AIS/Berger Levrault;
Michael Sperberg-McQueen University of Illinois at Chicago; David G.
Durand (dgd@cs.bu.edu) Boston University Computer Science. The Exeter
group and David Durand are keeping lists of bug reports,
modifications, etc.
If you are in the UK and have access to JANET, you may use blue book
file transfer to obtain the files from uk.ac.exeter.sgml1; use the
following address and protocol: uk.ac.exeter.sgml1 username:sgmlbox,
password:sgmlbox. If you have any problems contact sgml@uk.ac.exeter,
or phone (+44) 392 263946, or fax (+44) 392 211630. Exeter plans to
have anonymous-FTP support by the end of calendar 1991.
The summary above was adapted from a public posting by Paul Ellison to
TEI-L, 18-July-1991. Fuller description may be found in the
associated README files on the relevant file servers, and in "SGML
Parser Materials Donated to [the] SGML Users' Group," SGML Users'
Group Newsletter 20 (September 1991) 3-4.
The ARC-SGML parser materials are available via anonymous FTP from
several sites, typically under a directory ARC-SGML:
Host sgml1.ex.ac.uk (144.173.6.61)
Or uk.ac.exeter.sgml1 on JANET - see [113] above
Location /sgmlbox
Host mailer.cc.fsu.edu (128.186.6.103)
Location: /pub/sgml/ARC-SGML
Host ftp.ifi.uio.no (129.240.88.1)
Location: /pub/SIGhyper/ARC-SGML/distrib
Host src.doc.ic.ac.uk (146.169.3.7)
Location: /text/sgml
DIRECTORY rwxr-xr-x 512 Dec 13 05:03 ARC-SGML
[120] UNIX Port of ARC-SGML, by James Clark
The sources included in ARC-SGML have been modified to facilitate
compiling under UNIX (James Clark); the sources and patches are
available on several anonymous-FTP hosts. The filename is typically:
arcsgml-1.0jclark.tar.Z or something similar.
Host mailer.cc.fsu.edu (128.186.6.103)
Location: /pub/sgml/ARC-SGML.UNIX
FILE rw-r--r-- 289727 Aug 5 11:19 arcsgml-1.0jclark.tar.Z
Host src.doc.ic.ac.uk (146.169.3.7)
Location: /text/sgml/ARC-SGML.UNIX
FILE rw-r--r-- 289727 Aug 5 11:19 arcsgml-1.0jclark.tar.Z
Host nic.funet.fi (128.214.6.100)
Location: /pub/languages/sgml
FILE rw-rw-r-- 289727 Aug 4 16:17 arcsgml-1.0jclark.tar.Z
Host ftp.ifi.uio.no (129.240.88.1)
[121] 'SGMLS' Version of ARC-SGML, by James Clark
"A beta-test version of sgmls, a SGML parser derived from the ARCSGML
parser materials, is now available for anonymous ftp. . . Sgmls
outputs a simple, line oriented, ASCII representation of a SGML
document's Element Structure Information Set, which can easily be
parsed by awk, perl, C or whatever. The idea is that sgmls can be
used as the front end for a structure-controlled SGML application.
It is primarily intended for Unix systems, but it works also on MS-
DOS. I've tested it on the following architecture/OS/compiler
combinations: sparc/SunOS 4.1.1/cc, sparc/SunOS 4.1.1/gcc,
386/SVR3.2/cc, 286/MS-DOS/Borland C++ 2.0. It should be
straightforward to port to most systems that have 8-bit bytes and a
character set consistent with ISO 646 IRV.
(The beta version of sgmls (arguably) contains minor bugs which
prevent it from being fully conforming, but the author "aims to make
sgmls fully conforming to ISO 8879.")
Adapted from a posting by James Clark: jjc@jclark.com (James Clark) to
Usenet Newsgroup comp.text.sgml; Subject: sgmls available; Date: 28-
Oct-91.
Provisionally (December 1991) 'sgmls' is available on several anon-FTP
hosts:
Host ftp.ifi.uio.no (129.240.88.1)
Location: /SIGhyper/ARC-SGML/jclark
FILE r--r--r-- 757760 Oct 28 18:23 sgmls-0.3.tar
Host mailer.cc.fsu.edu (128.186.6.103)
Location: /pub/sgml/jclark
FILE rw-r--r-- 249967 Oct 29 11:35 sgmls-0.3.tar.Z
Host utsun.s.u-tokyo.ac.jp (133.11.11.11)
Location: /misc/sgml
FILE rw-r--r-- 249967 Oct 31 11:46 sgmls-0.3.tar.Z
Host src.doc.ic.ac.uk (146.169.3.7)
Location: /other3/text/sgml/jclark
FILE r--r--r-- 249967 Oct 29 11:35 sgmls-0.3.tar.Z
Host isfs.kuis.kyoto-u.ac.jp (130.54.20.1)
Location: /ftpmail/utsun.s.u-tokyo.ac.jp/misc/sgml
FILE rw-rw-r-- 249967 Dec 19 19:31 sgmls-0.3.tar.Z
Host puffin.doc.ic.ac.uk (146.169.3.7)
Location: /text/sgml/jclark
FILE r--r--r-- 249967 Oct 29 11:35 sgmls-0.3.tar.Z
[122] Macintosh Port of ARC-SGML, by Wally Wedel
According to a posting to Usenet Newsgroup comp.text.sgml by Wally
Wedel (31-December-1991), a version of ARC-SGML ported to the
Macintosh platform is available via anonymous-FTP from the site
rascal.ics.utexas.edu. The path to 'rascal' is not clearly indicated,
but the archive files are available on several other servers, as
listed below.
"I have recently finished porting ARC SGML 1.0 as released by the SGML
Users' Group to the Apple Macintosh environment. This archive is being
released to the InterNet community on an as-is, no-support basis. The
Compactor archive has been submitted to rascal.ics.utexas.edu. . ."
"ARC SGML 1.0 for the Macintosh: This archive is a port of an SGML
parser called ARC SGML to the Apple Macintosh environment. Original
sources of this parser are available from the SGML Users Group and
various FTP sites around the world. Users should read the document
LICENSE.DOC in the docs directory to determine the legal status of the
original code. My modifications impose no further restrictions on
use. This code is supplied on an as-is basis with no warranties of any
kind. . ."
"This code does contain errors. James Clark has caught and fixed some
in his SGMLS code for UNIX. There are no doubt more. Moreover this
code is not for the beginner or the timid. A copy of the SGML Handbook
by your side when working with it is highly recommended. . .This port
has been a 'spare time' project to take a close look at the
implementation. I have no plans for further enhancements in spite of
the appearance of the todo list in the docs directory."
"This archive unpacks a folder which I maintain inside my MPW Folder
hierarchy. I build the Think C version by copying the vm.9 project and
the sources from sgmlc and sgmlh into a folder named ARC SGML in my
Think C Development folder. . . A good place to start reading about
this port is the document 'Macintosh port' in the docs directory."
(Wally Wedel 12/30/91. Archive Name: ARC SGML 1.0.sea)
Contact: Wally Wedel; U S WEST Advanced Technologies; 4001 Discovery
Drive, Suite 390; Boulder, CO 80303 USA; Email (Internet):
wwedel@uswest.com; AppleLink: D5100; TEL: (1 303) 541-6052
Anonymous-ftp locations for ARC-SGML.MAC are (or were -- use ARCHIE):
Host mailer.cc.fsu.edu (128.186.6.103)
Location: /pub/sgml/ARC-SGML.MAC
FILE rw-r--r-- 461687 Jan 7 14:25 arc-sgml-10.hqx
Host sumex-aim.stanford.edu (36.44.0.6)
Location: /info-mac/app
FILE rw-r--r-- 461687 Jan 3 12:27 arc-sgml-10.hqx
Host nro.cs.athabascau.ca (131.232.1.1)
Location: /wuarchive/mirrors/info-mac/app
FILE r--r--r-- 461687 Jan 3 12:27 arc-sgml-10.hqx
Host lth.se (130.235.16.3)
Location: /pub/mac/info-mac/app
FILE rw-r--r-- 461687 Jan 5 01:18 arc-sgml-10.hqx
Host src.doc.ic.ac.uk (146.169.3.7)
Location: /text/sgml/ARC-SGML.MAC
FILE r--r--r-- 404959 Jan 7 14:25 arc-sgml-10.hqx.Z
Location: /micros/mac/sumex/app
FILE r--r--r-- 404959 Jan 3 12:27 arc-sgml-10.hqx.Z
Location: /mac/sumex/app
FILE r--r--r-- 404959 Jan 3 12:27 arc-sgml-10.hqx.Z
Location: /micros/mac/umich/etc/compsci
FILE r--r--r-- 391587 Jan 5 07:06 arcsgml1.0.cpt.hqx.Z
Location: /mac/umich/etc/compsci
FILE r--r--r-- 391587 Jan 5 07:06 arcsgml1.0.cpt.hqx.Z
Host puffin.doc.ic.ac.uk (146.169.3.7)
Location: /text/sgml/ARC-SGML.MAC
FILE r--r--r-- 404959 Jan 7 14:25 arc-sgml-10.hqx.Z
Location: /micros/mac/sumex/app
FILE r--r--r-- 404959 Jan 3 12:27 arc-sgml-10.hqx.Z
Location: /mac/sumex/app
FILE r--r--r-- 404959 Jan 3 12:27 arc-sgml-10.hqx.Z
Location: /micros/mac/umich/etc/compsci
FILE r--r--r-- 391587 Jan 5 07:06 arcsgml1.0.cpt.hqx.Z
Location: /mac/umich/etc/compsci
FILE r--r--r-- 391587 Jan 5 07:06 arcsgml1.0.cpt.hqx.Z
Host sics.se (192.16.123.90)
Location: /pub/info-mac/app
FILE rw-r--r-- 461687 Jan 8 05:14 arc-sgml-10.hqx
Host shark.mel.dit.csiro.au (144.110.16.11)
Location: /info-mac/app
FILE rw-r--r-- 461687 Jan 4 10:32 arc-sgml-10.hqx
Host ditmela.mel.dit.csiro.au (144.110.16.11)
Location: /info-mac/app
FILE rw-r--r-- 461687 Jan 4 10:32 arc-sgml-10.hqx
Host wuarchive.wustl.edu (128.252.135.4)
Location /mirrors/info-mac/app
FILE -r--r--r-- archive 461687 Jan 3 06:27 arc-sgml-10.hqx
[123] Amsterdam (Free University) Parser
A description of the Amsterdam Parser (ASP) is found in the article
cited sub section 1 above, [45]. "The Amsterdam SGML Parser uses an
LL(1) parser generator, notably LLgen, for both DTD and document
parsing. Actually, SGML is not LL(1), [so the developers] used the
'conflict resolvers' from LLgen to overcome the problems [they] came
across." Questions regarding the parser and its current status might
best be directed to the Vrije Universiteit: Prof. Dr. J.C. van Vliet,
Internet email: hans@cs.vu.nl. As a last resort, contact Jos Warmer
at PTT Research (no longer at VU): jos@pttrnl.nl OR possibly
JB_Warmer@pttrnl.nl.
The Amsterdam Parser is available via anonymous-FTP on the hosts
listed below, though support for the parser (obtained in this manner)
should not necessarily be expected.
Host star.cs.vu.nl (130.37.24.6)
DIRECTORY rwxr-xr-x 512 Aug 7 10:34 Sgml
FILE rw-r--r-- 419803 Aug 7 10:23 Sgml.tar.Z
Host nic.funet.fi (128.214.6.100)
Location: /pub/languages/sgml
FILE rw-r--r-- 419803 Aug 12 02:14 Sgml.tar.Z
Host isfs.kuis.kyoto-u.ac.jp (130.54.20.1)
Location: /ftpmail/ftp.ricoh.co.jp/pub/misc/sgml
FILE rw-rw-r-- 419803 Dec 18 17:44 Sgml.tar.Z
Host theta.iis.u-tokyo.ac.jp (130.69.48.4)
Location: /sgml
FILE rw-rw-r-- 419803 Dec 13 13:12 Sgml.tar.Z
[124] Software Exoterica - XGML Validator (XGML Normalizer, XGML
Translator, XGML OmniMark)
An entry-level validating SGML parser distributed by EPSIG and GCA for
just sixty-five (65) US dollars is Software Exoterica's Validator(tm).
XGML Validator is an application of Software Exoterica's XGML
Translator, and contains a subset of features implemented in the
fully-functional XGML Normalizer and XGML Tester. Validator is a
batch SGML parser with error detection but no entity expansion or
normalization features. XGML Normalizer also produces a version of
the document with all shortrefs and entities expanded; Tester will
generate an even more detailed version of the document in various
formats, including the format suggested by the newest ISO standard for
conformance testing. The Validator parser validates SGML
declarations, DTDs and document instances: it detects all errors in a
single pass, reports errors in a diagnostic transcript, recovers
intelligently from errors and always parses the full document.
Validator requires a PC (MS-DOS) with 386 CPU and at least 4 megabytes
of memory.
See further description in: (1) "EPSIG Sells XGML Validator," The
Seybold Report on Publishing Systems 20/9 (January 28 1991) 28; (2)
"EPSIG and GCA Selling Exoterica Validator," SGML Users' Group
Newsletter 20 (September 1991) 32-33, or (3) [TAG] 17 (December 1990)
4, or EPSIG News 3/4 (December 1990) 9. Ordering from GCA or EPSIG:
EPSIG; c/o OCLC, Mail Code 278; 6565 Frantz Road; Dublin, OH 43017-
0702; TEL: (1 614) 764-6195; FAX: (1 614) 764-6096. GCA address:
Graphic Communications Association; 100 Daingerfield Road, Alexandria
VA 22314; TEL: (1 703) 519-8160; Telex: 510-600-0889; FAX: (1 703)
548-2867.
Software Exoterica's related products include an SGML editor and
several tools for SGML translation/transduction. (1) XGML CheckMark
is a full-screen SGML editor available for the Mac II. It was
originally developed as a conversion tool, but can be used for a
variety of authoring/writing tasks. CheckMark uses actual DTDs, not
rules files, and can validate documents while they are being edited.
Tags may be entered into documents from menus or directly from the
keyboard. (2) XGML Translator is an SGML-based string transducer
which uses a string conversion language called XTRAN. Translator is
available for Unix systems and 386 DOS microcomputers. Translator's
functions are up-, down-, context- and cross-translations -- creating
various kinds of SGML [[-]] non-SGML conversions of documents and
structured information. Exoterica considers XGML Translator
appropriate for third generation applications. See Brian Travis,
"Review of the Exoterica XGML Translator," [TAG] 17 (December 1990) 5-
11. (3) XGML OmniMark is Exoterica's tool for fourth generation
applications using markup languages. OmniMark script-writing and
document translation facilities can be used to convert SGML documents
to input languages of other languages, and to convert the output
languages of other to SGML-defined languages (e.g., AAP, CALS), and to
convert between arbitrary languages and data formats. OmniMark is
available for MS-DOS (386-class machines), Microsoft Windows 3.0,
OS/2, Macintosh, SCO UNIX, DEC VAX, and most UNIX systems. See EPSIG
News 4/4 (December 1991) 5-6.
Queries about XGML Validator and the full line of Exoterica's XGML
products may be directed to: Software Exoterica Corporation; 383
Parkdale Avenue, Suite 406; Ottawa, Ontario; CANADA K1Y 4R4; TEL: 1-
613-722-1700; TEL: 1-800-565-9465 (1-800-565-XGML); FAX: 1-613-722-
5706.
[125] SEMA Group/Yard Software Systems Limited (Mark-It Light)
Mark-It Light is a "low cost" entry-level validating parser from the
SEMA Group, and runs on MS-DOS 286 and 386 class microcomputers. It
is based upon the so-called Sobemap parser, named after the European
company now known as the SEMA Group. Mark-It Light supports a subset
of features found in SEMA's fuller Mark-It package. Mark-It includes,
as part of the package, a structured document editor called Write-It,
which is equivalent to IBM's TextWrite and SoftQuad's Author/Editor.
The Mark-It parser supports SGML more completely than most other SGML
parsers: multiple concurrent documents, subdocuments, data tags,
attributes, marked sections, rank, tag omission (minimization), short
tags, short references, formal public identifiers, model inclusions,
links (simple, implicit and explicit links). Additionally, Mark-It
uses SGML together with a regular expression pattern recognizer to
control the process of converting files both into and out of SGML
form: "users can easily convert to and from TeX, including the LaTeX
and JLaTeX forms, without having to be provided with a preconfigured
TeX conversion package." Other components in the suite include
Parse-It (file creator and parser), Compile-It (SGML DTD compiler for
Write-It) and Lisp-It (an SGML-aware interpreter useful for
applications development). Compiled versions of Mark-It and related
tools are also available for UNIX and most mainframe computers, while
C source code is available to OEM developers.
The Mark-It Light parser itself thus supports basic validation: markup
using the complete range of tag minimization features (data tagging,
SGML tag ranking option), and interactive correction of markup errors;
it also provides access to the full set of 256 character codes
(extended ASCII). See: (1) "Yard Release[s] Mark-It Light," SGML
Users' Group Newsletter 18 (November 1990) 22; (2) "Mark-It Light,"
EPSIG News 4/1 (March 1991) 4; (3) Martin Bryan and Ed Warnshuis,
"Comments from our readers (letter to the .
editor)." The Seybold
Report on Publishing Systems 20/10-11 (February 25, 1991) 2-4; (4)
"Five New Products from Sema," EPSIG News 4/3 (September 1991) 7.
Contacts: In Europe, Products Manager; SEMA Group Systems Ltd.;
Avonbridge House, Bath Road, Chippenham, Wiltshire SN15 2BB; TEL +44-
249-656-194 (Paul Moorhead); FAX +44-249-655-723; or, Martin Bryan;
Yard Software Systems; 29 Oldbury Orchard; Churchdown; Glos GL3 2PU,
UNITED KINGDOM; TEL: +44 452 714029; or Griet Descheemaeker; SEMA
Group Belgium; Place du Champ de Mars 5; Boite 40;B-1050 Brussels;
BELGIUM; TEL: +32 2 508 53 23; In the US: John W. Oster II; Yard
Software USA; 125 North Main Street; Bel Air, MD 21014; TEL: 301/838-
1911; FAX: 301/838-1913.
[126] SoftQuad (Author/Editor)
Transparent SGML parsing facilities are incorporated into SoftQuad's
popular Author/Editor and related SGML products. Author/Editor (A/E)
is an SGML(-aware) text editor and word-processor used as a major
component in electronic publishing by a wide range of developers.
Author/Editor uses pre-compiled DTDs rather than directly-editable
ASCII DTD files (viz, DTDs based upon the AAP tagsets, CALS-SGML DTDS,
ATA DTDs, or other DTDs shipped with A/E). Users who wish to
customize standard DTDs or build new DTDs rely upon a second SoftQuad
product, RulesBuilder (or its batch counterpart, 'buildRules'). The
user is informed by A/E of all legal tags at any given point in the
text by means of pull-down menus, from which the tags may be selected.
Valid SGML document structure (i.e., conforming to the rules file) is
enforced because it is not possible to type SGML tags directly into
the file. Recent (1991) enhancements to A/E include a macro
programming language (Scheme, based upon LISP), a new tables editor
and graphics module. A/E interfaces with a number of other related
products to support text and graphics formatting of the SGML files
(SoftQuad Publishing Software). A/E is currently (1991) supported for
both Apple Macintosh and DOS (MS-Windows), as well as for SunOS,
Ultrix, Sun Open Windows, DEC Motif, etc. Prices (1991, with 12-month
support contract) are about 1995 dollars US (DOS), 3295 dollars US
(UNIX) and 1695 dollars US (Mac), though academic discounts will
likely be arranged. See further description: (1) "SoftQuad
Author/Editor: An SGML Context-sensitive Text Entry System." Pp. 99-
103 in Tools for Humanists, 1989. A Guidebook to the Software and
Hardware Fair Held in Conjunction with the Dynamic Text [6-9 June 1989
Toronto]. Toronto, Ontario: Centre for Computing in the Humanities,
1989; (2) "SoftQuad - New Language/Platform," SGML Users' Group
Newsletter 19 (April 1991) 21; (3) "SoftQuad Adds Table Editor," EPSIG
News 4/4 (December 1991) 4.
Contact: In North America: SoftQuad Inc.; 56 Aberfoyle Crescent, Suite
810; Toronto, Ontario; Canada M8X 2W4; TEL: +1 416 239 4801; +1-800-
387-2777; FAX: +1 416 239 7105. Email: dns@sq.com (David Slocombe).
In Europe: Open Information Technology (UK) Ltd.; Marble Arch; King
Street; Knutsford; Cheshire WA16 6HD; UNITED KINGDOM; TEL: +44 565
50021; FAX: +44 565 51093.
[127] E2S (EASE -- E2S Advanced SGML Editor)
"EASE enables the production of Basic SGML documents (see clause
15.1.1 of ISO 8879), using any PC running DOS 3.30 (or higher) and
with 550 kB free memory. It also runs on UNIX graphical workstations,
under X-windows. EASE consists of two integrated parts, the DTD-
editor and the DI-Editor, both of which offer on-line and context-
sensitive control and help in the use of SGML (syntax- oriented
editing). They also offer on-line, context-sensitive (and hypertext-
like) help in the use of the editors themselves. The user interface
is window-based, with pull-down menus for the various
options/functions -- which can be accessed not only via the mouse, but
also via the keyboard or function keys. All nine official languages
of the E.C. are supported simultaneously, and unformatted output can
be sent straight to an ASCII or PostScript printer. On the PC
version, individual copies of the software require a protection key
(dongle) to be fitted to the Centronics (parallel) port."
"The DTD-editor is used to create, edit, validate and compile DTDs for
use with the DI-editor. Inexperienced users can use options from the
pull-down menus to build DTDs, whilst others can enter the text
directly (with DTD-editor automatically picking up any syntactic
errors). Other DTDs stored in plain ASCII files, can be imported into
the DTD-editor for parsing, validation, and compilation (provided that
they conform to the requirements suitable for producing Basic SGML
documents). On the basis of the (compiled) DTD, the DI-editor
displays the elements for the minimal logical structure of a document
of that type. Users may then either edit the logical structure of the
document (all actions being checked for semantic correctness), or
select appropriate elements and swap into the text window for data
entry (and/or the entry of further valid elements). It is also
possible to import a plain text file, conforming to ISO 2022, and add
valid markup according to the DTD; the resulting document will be a
validated Basic SGML document (extended for multilinguism)."
Reference: "New SGML Editor from E2S," EPSIG News 4/3 (September 1991)
6-7.
Normal licence fees: are 1,250 dollars US (PC); 6,500 dollars US,
(workstation); University licence fees: PC: 125 dollars US + medium,
documentation, administration and shipment: 500 dollars US;
Workstation: 650 dollars US + medium, documentation, administration
and shipment: 500 dollars US. This information thanks to Caroline de
Vleeschauwer (Email: cdv%e2s@relay.EU.net, Date: Thu, 19 Dec 91)
Contact: Joost CARDOEN, Managing Director, E2S, Moutstraat 100; B-9000
Gent, Belgium, TEL: +32/91/21.03.83, FAX: +32/91/20.31.91, e-mail:
jca@e2s.be; OR Ronny Verkest, Sales Manager; E2S; Moutstraat 100; B-
9000 Gent; Belgium; TEL: +32(91)21 03 83; FAX: +32(91)20 31 91; Email:
e2s@e2s.be (Internet).
[128] Electronic Book Technologies (MS-Windows) DynaText SGML
Browse/Search
DynaText supports SGML document indexing/searching/browsing and is
implemented for MS-DOS (Windows), as well as for various UNIX
platforms. It allows users to read, query, dynamically display and
annotate electronic "books" or other structured information. DynaText
uses SGML element tags to automatically generate hyperlinks to
associated material such as diagrams, tables, and explicit cross
references; it allows users to add their own link types/behavior
through simple style sheet entries.
Electronic style sheets are held in ascii-editable files with SGML
syntax. This mechanism can be employed by users who want to create
dynamic multi-media documents. Style definitions may be used to set
the display characteristics (font type, size, color) including
visibility or suppression of each SGML element. The principle of
conditional visibility of elements (and element classes) in response
to style sheets and icon clicks permits rapid customization of
electronic books where a variety of document editions is desired.
DynaText builds a full text index of the SGML document and (unlike
other indexers that simply report occurrences within an entire
document) can report occurrences within SGML components. Hit-list
statistics for each document section provides an unprecedented level
of search precision that enables users to find terms within the
relevant sections of the document quickly. Wild cards and regular
expressions may be used in queries, eliminating the need for exact
string matches; Boolean logic (AND, OR) may also be specified. The
indexer supports synonym lists that act like special purpose thesauri
that enable access to information though a variety of synonymous
terms. This feature is especially useful in acronym-laden technical
reference manuals.
In addition to displaying SGML-structured character text, DynaText
supports a variety of popular raster formats (such as PICT, TIFF, Sun
Raster and CCITT fax formats) to facilitate capture of associated
artwork. The system also supports an open architecture for
integration with multi-media applications allowing sound, animation
and video supplements to be added to existing reference documents.
Discounts up to 80 percent are available to academic institutions. See
fuller descriptions of the product in: (1) "DynaText: Electronic Book
Engine from EBT [Electronic Book Technologies]: First to Handle any
SGML Application," Seybold Report on Publishing Systems 20/2
(September 24, 1990) 18-22; (2) Steven J. DeRose, "DynaText:
Electronic Book Indexer/Browser," EPSIG News 3/4 (December 1990) 1-2;
(3) "Electronic Book Technologies," SGML Users' Group Newsletter 20
(September 1991) 32; (4) "Electronic Book Technologies' Dynatext
(SGML compiler). Text Tools: Beyond Search and Retrieval." Release
1.0 91/7 (July 31, 1991) 15-17.
Contacts: (US): Electronic Book Technologies, Inc.; One Richmond
Square; Providence, RI 02906 USA; TEL: (401) 421-9550; FAX: (401)
421-9551; (Internet): sjd%ebt-inc@uunet.uu.net (Steven J. DeRose), or
ebt-inc!sjd@uunet.uu.net ([netnote]that's ebt-
inc!sjd@uunet.uu.net[/]); (Internet): lrr%ebt-inc@uunet.uu.net
(Louis R. Reynolds); in Europe: EBT International; 20, Pre de la
Ferme; 1261 Gingins; SWITZERLAND; TEL: +41-22-69-24-24; FAX: +41-22-
69-24-25.
[129] Avalanche Development Company (FastTag, IMSYS) Intelligent
Autotagging
Retro-fitting electronic data files held in proprietary format with
SGML tagging, or converting paper documents to SGML-tagged files is
frequently accomplished with the use of intelligent automatic markup
software. The "autotagging" software from Software Exoterica (XGML
Translator, XGML OmniMark) was noted briefly above,; see sub [124] and
compare [44]. Other products for SGML conversion/translation are
Avalanche Development's tools FastTag and IMSYS: they incorporate
autotagging software using visual recognition techniques to generate
structured output text. Using a VRE(tm) Visual Recognition Engine,
FastTag reads visual clues: (1) from an output file in the case of
electronic files or, (2) in conjunction with OCR, to convert paper-
based documents into SGML electronic file formats. FastTag reckons
with graphics objects, tables and similar text structures. Input may
be ASCII, Calera PDA files, DCA/RTF files, WordPerfect or Word files
(other wordprocessor formats supported), OCR/ICR scanner files,
DECWrite, Interleaf, and so forth; output is user-definable, but may
be specified as SGML, RTF, FrameMaker (Maker Interchange Format, MIF),
Ventura Publisher, Interleaf, Datalogics, troff, nroff, tbl and so
forth. FastTag and IMSYS are supported on MS-DOS, as well as on a
variety of UNIX, VMS and Ultrix systems. The autotagging technology
is licensed by a number of OEM developers (Xerox Information Systems
(Kurzweil), Bell Atlantic (DocuSource), Shaffstall, IBM and DEC).
See: (1) "Avalanche Development's FastTAG." Release 1.0 91/7 (July
31, 1991) 14-16; (2) "Avalanche - New Developments," SGML Users'
Group Newsletter 19 (April 1991) 18-19; (3) "Avalanche FASTTAG to
Support EMS DTDs," EPSIG News 3/2 (June 1990) 5-6 (Avalanche announces
intention to support DTDs of the EPSIG/AAP Electronic Manuscript
Standard); (4) "Systems Solutions for the '90s: In-Plant,
Commercial. [Avalanche Development]," Seybold Report on Publishing
Systems 18/14 (April 24, 1989) 26-43 (reviews Avalanche's Imsys.CALS,
based on its Visual Recognition Engine, which interprets visual cues
in the output file of word processors and generates a tagged ASCII
file. It thus provides means of updating files to CALS-compliant SGML
format).
Related products include Proof Positive (spelling, grammar and style
checking facilities for use with Interleaf and FrameMaker) and
hypertext interface facilities. Contact: Eileen Quirk, Director of
Marketing and Sales; Avalanche Development Company; 947 Walnut Street;
Boulder, CO 80302 USA; (303) TEL: 449-5032; FAX: (303) 449-3246.
Email (Internet): support@avalanche.com, sales@avalanche.com,
apps@avalanche.com.
[130] qwertz/FORMAT -- SGML to LaTeX (and nroff/troff) Translator
Version 1.1 of the qwertz SGML documenting processing system is now
available. Format is an SGML to LaTeX (and nroff/troff) translator.
It consists of: (1) SGML document type definitions for the LaTeX
document styles (articles, books, reports, letters, slides), for
BibTeX bibliographies and for Unix manual pages; (2) Programs for
translating SGML documents of the these types into LaTeX and
nroff/troff; (3) A command for extracting source code from
documentation, which is useful for a simple kind of "literate
programming".
Format allows LaTeX documents to be created using powerful and
comfortable SGML editors, such as Author/Editor, in a quasi WYSIWYG
manner. It also provides a layer of abstraction from LaTeX, allowing
documents to be more easily translated into formats for other document
processing systems, such as nroff/troff or MS-WORD.
Anonymous-FTP locations (use ARCHIE for updates):
Host gmdzi.gmd.de (129.26.1.90)
Location: /pub/gmd
FILE rw-r--r-- 959 Sep 30 11:51 sgml2latex-format.readme
FILE rw-r--r-- 1336911 Sep 30 11:51 sgml2latex-format.tar.Z
Host utsun.s.u-tokyo.ac.jp (133.11.11.11)
Location: /misc/sgml
FILE rw-r--r-- 1336911 Oct 31 11:48 sgml2latex-format.tar.Z
Host liasun3.epfl.ch (128.178.36.30)
Location: /pub/tex
FILE rw-r--r-- 1336911 Oct 23 20:28 sgml2latex-format.tar.Z
Contact: Thomas F. Gordon; German National Research Center for
Computer Science (GMD); Schloss Birlinghoven, D-5205 Sankt Augustin 1,
Germany; Internet email: thomas@gmdzi.gmd.de; phone: (+49 22441) 14-
2665. (Adapted from Usenet posting to comp.text.sgml; Subject: New
Version of Format; Date: 30-Sep-91)
[131] NIST "SGML parser materials"
The older NIST parser materials are reported to be out-of-date and
somewhat unreliable. Furthermore, questions persist concerning the
NIST's support for these code sources, and, in light of apparently
stronger current support for ODA (e.g., NIST ODA SIG in the OSI
Implementors Workshop), for SGML more generally. It would be wise to
consult with NIST and with a recognized SGML guru before spending time
building on these SGML parser tools.
Host stag.math.lsa.umich.edu (141.211.64.23)
FILE rw-r--r-- 1731 Sep 15 1990 nist-sgml.cheater
FILE rw-r--r-- 202923 Sep 15 1990 nist-sgml.tar.Z
Host nic.funet.fi (128.214.6.100)
Location: /pub/languages/sgml
FILE rw-r--r-- 1731 Aug 12 02:29 nist-sgml.cheater
FILE rw-r--r-- 202923 Aug 12 02:29 nist-sgml.tar.Z
Host theta.iis.u-tokyo.ac.jp (130.69.48.4)
Location: /sgml
FILE rw-rw-r-- 871 Dec 13 13:13 sgml.cheater.Z
FILE rw-rw-r-- 198363 Dec 13 13:15 sgml.cpio.Z
[132] Public DTD Collections
Public DTDs and entity sets for SGML are printed in the standards
documents and in most handbooks, and are usually shipped with
commercial SGML software. The nineteen (19) character entity sets
published in Annex D of the ISO 8879 SGML Standard are legally
unencumbered, and will probably become widely available in electronic
format (use ARCHIE to search for filenames like "ISOlat1," "ISOlat2,"
ISOgrk1," "ISOgrk2," "ISOgrk3," "ISOgrk4," "ISOcyr1," "ISOcyr2,"
"ISOnum," "ISOdia," "ISOpub," "ISObox," "ISOtech," "ISOamso,"
"ISOamsb," "ISOamsr," "ISOamsa," "ISOamsn," "ISOamsc," etc.).
Currently, sample DTDs in usable ASCII format may be obtained on the
academic networks from several sources:
(a) DTDs produced by the Text Encoding Initiative (TEI) are available
from the BITNET listserv(er) on UICVM. The filenames and content of
these DTDs can be expected to undergo change during 1992, but they
currently have a VM/CMS filespec of '[filename] DTD' (e.g., TEI1 DTD;
TEIHDR1 DTD; TEIWSD1 DTD; TEIBASE1 DTD; TEIFRON1 DTD; TEIBACK1 DTD;
TEILOW1 DTD; TEICRYS1 DTD; TEILING1 DTD; TEIREND1 DTD; TEIDRAM1 DTD;
TEITC1 DTD; TEITC1 DTD). A number of related files on TEI syntax are
also available (SYNTAX MEMO; OVERVIEW SYNTAX; BASICS SYNTAX; DOCUMENT
SYNTAX; SGMLDECL SYNTAX; DTD SYNTAX). Use interactive BITNET command
or mail to request the files, "get [filename filetype]" and item [114]
above for other standard BITNET LISTSERVer syntax. The DTDs may be
obtained also from the LISTSERVer in Europe (Goettingen) which hosts
MARKUP-L: LISTSERV@DGOGWDG1 on BITNET or listserv@ibm.gwdg.de on the
Internet; see [116].
(b) MAJOUR (Modular Application for Journals). The EWS (European
Workgroup on SGML) has published a DTD for the headers of scientific
articles (MAJOUR-Header DTD). In cooperation with the CERN EPS
(European Physical Society), EWS has now also produced a "body"
standard DTD for complete scientific articles. The DTDs developed by
EWS and EPS/CERN are based upon AAP article DTDs. See above entry
[110] for further details on EWS. Free copies of the MAJOUR DTD(s)
may be obtained from the STM (International Group of Scientific
Technical and Medical Publishers): contact Ms Harriet de Hoog, STM
Secretariat, Keizersgracht 462, 1016 GE Amsterdam, THE NETHERLANDS;
FAX +31-20-38-15-66. See further in "European Workshop on SGML," SGML
Users' Group Newsletter 20 (September 1991) 22. Alternately, contact
the GCA for a free copy of MAJOUR (with purchase of other materials
from the GCA's printed materials); see SGML Users' Group Newsletter 20
(September 1991) 32.
(c) A HyTime DTD (declarations set) is located on the online SIGhyper
forums: Host ftp.ifi.uio.no (129.240.88.1) and Host mailer.cc.fsu.edu
(128.186.6.103). Disk space has been secured at the University of
Oslo, Department of Informatics for a larger collection of public
DTDs: see ftp.ifi.uio.no:/pub/SGML for contributions that may be
available from that archive (e.g., for the "general" document DTD
'PUBLIC "ISO 8879:1986//DTD General Document//EN"' from ISO 8879 Annex
E as amended).
(d) Public DTDs and entity sets should be available from Exeter. One
may periodically check the online library of the Exeter SGML Project
for its collections: anon-FTP to sgml1.ex.ac.uk (144.173.6.61). See
item [113] above.
(e) The CALS-BBS forum described sub item [117] above contains several
CALS-related DTDs, usually in self-extracting DOS compressed files
(filename.exe).
(f) DTDs supporting the AAP/EPSIG manuscript standard are available
from EPSIG for 10 US dollars plus shipping; see [107]. The same files
may be accessible from file servers: use ARCHIE to search for
AAPARTCL.TXT; AAPBOOK.TXT; AAPSERL.TXT; SHORTREF.TXT or similar
filenames.
(g) The "Information Architecture" working group of the OSF
Documentation Special Interest Group (IA WG of the OSF Doc SIG) will
be releasing public DTDs during the first and second quarters of 1992.
Contact Fred Dalrymple, the Open Software Foundation; TEL: (1 617)
621-8855, or John Bowe, Email (Internet): bowe@osf.org.